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The present invention relates generally to proteinase inhibitors, a precursor thereof and to genetic sequences encoding same. More 
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to a sequence which encodes a type II serin proteinase inhibitor (PI) precursor from a plant wherein said precursor comprises at least three 
PI monomers and wherein at least one of said monomers has a chymotrypsin specific site and at least one other of said monomers has a 
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A PROTEINASE INHIBITOR, PRECURSOR THEREOF AND GENETIC 
SEQUENCES ENCODING SAME 

5 The present invention relates generally to proteinase inhibitors, a precursor thereof 
and to genetic sequences encoding same. 

Nucleotide and amino acid sequences are referred to herein by sequence identity 
numbers (SEQ ID NOs) which are defined after the bibliography. A general 
10 summary of the SEQ ID NOs is provided before the examples. 

Throughout this specification and the claims which follow, unless the context requires 
otherwise, the word "comprise", or variations such as "comprises" or "comprising", will 
be understood to imply the inclusion of a stated element or integer or group of 
15 elements or integers but not the exclusion of any other element or integer or group 
of elements or integers. 

Several members of the families Solanaceae and Fabaceae accumulate serine 
proteinase inhibitors in their storage organs and in leaves in response to wounding 

20 (Brown and Ryan, 1984; Richardson, 1977). The inhibitory activities of these 
proteins are directed against a wide range of proteinases of microbial and animal 
origin, but rarely against plant proteinases (Richardson, 1977). It is believed that 
these inhibitors are involved in protection of the plants against pathogens and 
predators. In potato tubers and legume seeds, the inhibitors can comprise 10% or 

25 more of the stored proteins (Richardson, 1977), while in leaves of tomato and potato 
(Green and Ryan, 1972), and alfalfa (Brown and Ryan, 1984), proteinase inhibitors 
can accumulate to levels of 2% of the soluble protein within 48 hours of insect 
attack, or other types of wounding (Brown & Ryan, 1984; Graham et dL, 1986). 
High levels of these inhibitors (up to 50% of total soluble protein) are also present 

30 in unripe fruits of the wild tomato, Lycopersicon peruvianum (Pearce et dL y 1988). 
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There are two families of serine proteinase inhibitors in tomato and potato (Ryan, 
1984). Type I inhibitors are small proteins (monomer Mr 8100) which inhibit 
chymotrypsin at a single reactive site (Melville and Ryan, 1970; Plunkett et al, 
5 1982). Inhibitors of the type II family generally contain two reactive sites, one of 
which inhibits chymotrypsin and the other trypsin (Bryant et al, 1976; Plunkett et 
al, 1982). The type U inhibitors have a monomer Mr of 12,300 (Plunkett et al, 
1982). Proteinase inhibitor I accumulates in etiolated tobacco (Nicotiana tabacum) 
leaves (Kuo et al, 1984), and elicitors from Phytophthora pamsUicovax. nkotionoe 
10 were found to induce proteinase inhibitor I accumulation in tobacco cell suspension 
cultures (Rickauer et al, 1989). 

There is a need to identify other proteinase inhibitors and to investigate their 
potential use in the development of transgenic plants with enhanced protection 

15 against pathogens and predators. In accordance with the present invention, genetic 
sequences encoding a proteinase inhibitor precursor have been cloned. The 
precursor has multi-proteinase inhibitor domains and will be useful in developing a 
range of transgenic plants with enhanced proteinase inhibitor expression. Such 
plants will have enhanced protective properties against pathogens and predators. 

20 The genetic constructs of the present invention will also be useful in developing 
vaccines for ingestion by insects which are themselves predators or which act as hosts 
for plant pathogens. The recombinant precursor or monomelic inhibitors will also 
be useful in topical sprays and in assisting animals in feed digestion. 

25 Accordingly, one aspect of the present invention relates to a nucleic acid molecule 
comprising a sequence of nucleotides which encodes or is complementary to a 
sequence which encodes a type II serine proteinase inhibitor (PI) precursor from a 
plant wherein said precursor comprises at least three PI monomers and wherein at 
least one of said monomers has a chymotrypsin specific site and at least one other 

30 of said monomers has a trypsin specific site. 
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The "nucleic acid molecule" of the present invention may be RNA or DNA (eg 
cDNA), single or double stranded and linear or covalently closed. The nucleic acid 
molecule may also be genomic DNA corresponding to the entire gene or a 
5 substantial portion thereof or to fragments or derivatives thereof. The nucleotide 
sequence may correspond to the naturally occurring nucleotide sequence of the 
genomic or cDNA clone or may contain single or multiple nucleotide substitutions, 
deletions and/or additions thereto. All such variants in the nucleic acid molecule 
either retain the ability to encode at least one monomer or active part thereof or are 
10 useful as hybridisation probes or polymerase chain reaction (PGR) primers for the 
same or similar genetic sequences in other sources. 

Preferably, the PI precursor comprises at least four, more preferably at least five and 
even more preferably at least six PI monomers. Still more preferably, the PI 
15 precursor further comprises a signal sequence. The PI precursor of the present 
invention preferably comprises amino acid sequences which are process sites for 
cleavage into individual monomers. 

The term "precursor" as used herein is not intended to place any limitation on the 
20 utility of the precursor molecule itself or a requirement that the molecule first be 
processed into monomers before PI activity is expressed. The precursor molecule 
has PI activity and the present invention is directed to the precursor and to the 
individual monomers of the precursor. 

25 Furthermore, the present invention extends to a nucleic acid molecule comprising 
a sequence of nucleotides which encodes or is complementary to a sequence which 
encodes a hybrid type II serine PI precursor wherein said precursor comprises at 
least two monomers from different Pis. The at least two monomers may be modified 
such as being unable to be processed into individual monomers or may retain the 

30 ability to be so processed. Preferably, at least one of said monomers has a 
chymotrypsin specific site and the other of said monomers has a trypsin specific site. 
Preferably there are at least three monomers, more preferably at least four 
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monomers, still more preferably at least five monomers and yet still more preferably 
at least six monomers wherein at least two are from different Pis. In a most 
preferred embodiment, at least one of said monomers is a thionin. Such hybrid PI 
precursors and/or monomers thereof are particularly useful in generating molecules 
5 which are "multi-valent" in that they are active against a range of pathogens and 
predators such as both fungi and insects. Accordingly, reference herein to "PI 
precursor" includes reference to hybrid molecules. 

The present invention is exemplified by the isolation of the subject nucleic acid 
10 molecule from Nicotiana aia/awhich has the following nucleotide sequence (SEQ ID 
NO. 1) and a corresponding amino acid sequence (SEQ ID NO. 3): 



15 
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Lys 


GCT 
Ala 


TGT 
Cys 


ACC 
Thr 


TTA 
Leu 


AAC 
Asn 


TGT 
Cys 


GAT 
Asp 


CCA 
Pro 


AGA 
Arg 


ATT 
He 


GCC 
Ala 


TAT 
Tyr 


GGA 
Gly 


GTT 
Val 


TGC 
Cys 


CCG 
Pro 


CGT 
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TCA 
Ser 
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Glu 


GAA 
Glu 


AAG 
Lys 


20 


AAG 
Lys 


AAT 
Asn 


GAT 
Asp 


CGG 
Arg 


ATA 
He 


TGC 
Cys 


ACC 
Thr 


AAC 
Asn 


TGT 
Cys 


TGC 
Cys 
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Ala 


GGC 
Gly 


ACG 
Thr 


AAG 
Lys 


GGT 
Gly 


TGT 
Cys 




AAG 
Lys 


TAC 
Tyr 


TTC 
Phe 


AGT 
Ser 


GAT 
Asp 


GAT 
Asp 


GGA 
Gly 


ACT 
Thr 


TTT 
Phe 


GTT 
Val 


TGT 
Cys 


GAA 
Glu 


GGA 
Gly 


GAG 
Glu 


TCT 
Ser 


GAT 
Asp 


25 


CCT 
Pro 


AGA 
Arg 


AAT 
Asn 


CCA 
Pro 


AAG 
Lys 


GCT 
Ala 


TGT 
Cys 


ACC 
Thr 


TTA 
Leu 


AAC 
Asn 


TGT 

Cys 


GAT 
Asp 


CCA 
Pro 


AGA 
Arg 


ATT 
He 


GCC 
Ala 


30 


TAT 
Tyr 


GGA 
Gly 


GTT 
Val 


TGC 
Cys 
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Pro 
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Arg 
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GAA 
Glu 
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Lys 
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Thr 


AAG 
Lys 


GGT 
Gly 


TGT 
Cys 


AAG 
Lys 


TAC 
Tyr 


TTC 
Phe 


AGT 
Ser 


GAT 
Asp 


GAT 
Asp 


35 


GGA 
Gly 


ACT 
Thr 


TTT 
Phe 


GTT 
Val 


TGT 
Cys 
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Glu 
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Gly 


GAG 
Glu 
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Ser 
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Pro 
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GCT 
Ala 
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Cys 
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Asp 
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Pro 
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Arg 
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He 
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TAT 
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He 
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Cys 
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Pro 


CTT 
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GCA GAA GAA AAG AAG AAT GAT CGG ATA TGC ACC AAC TGT TGC GCA GGC 
Ala Glu GLu Lys Lys Asn Asp Arg lie Cys Thr Asn Cys Cys Ala Gly 

5 AAA AAG GGT TGT AAG TAC TTT AGT GAT GAT GGA ACT TTT GTT TGT GAA 
Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr Phe Val Cys Glu 

GGA GAG TCT GAT CCT AAA AAT CCA AAG GCC TGT CCT CGG AAT TGT GAT 
Gly Glu Ser Asp Pro Lys Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp 

10 

GGA AGA ATT GCC TAT GGG ATT TGC CCA CTT TCA GAA GAA AAG AAG AAT 
Gly Arg He Ala Tyr Gly He Cys Pro Leu Ser Glu Glu Lys Lys Asn 

GAT CGG ATA TGC ACC AAC TGC TGC GCA GGC AAA AAG GGT TGT AAG TAC 
15 Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 

TTT AGT GAT GAT GGA ACT TTT GTT TGT GAA GGA GAG TCT GAT CCT AAA 
Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Lys 

20 AAT CCA AAG GCT TGT CCT CGG AAT TGT GAT GGA AGA ATT GCC TAT GGG 
Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Gly Arg He Ala Tyr Gly 

ATT TGC CCA CTT TCA GAA GAA AAG AAG AAT GAT CGG ATA TGC ACA AAC 
He Cys Pro Leu Ser Glu Glu Lys Lys Asn Asp Arg He Cys Thr Asn 

25 

TGT TGC GCA GGC AAA AAG GGC TGT AAG TAC TTT AGT GAT GAT GGA ACT 
Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr 

TTT GTT TGT GAA GGA GAG TCT GAT CCT AGA AAT CCA AAG GCC TGT CCT 
30 Phe Val Cys Glu Gly Glu Ser Asp Pro Arg Asn Pro Lys Ala Cys Pro 

CGG AAT TGT GAT GGA AGA ATT GCC TAT GGA ATT TCC CCA CTT TCA GAA 
Arg Asn Cys Asp Gly Arg He Ala Tyr Gly He Cys Pro Leu Ser Glu 

35 GAA AAG AAG AAT GAT CGG ATA TGC ACC AAT TGT TGC GCA GGC AAG AAG 
Glu Lys Lys Asn Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys 

GGC TGT AAG TAC TTT AGT GAT GAT GGA ACT TTT ATT TGT GAA GGA GAA 
Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr Phe He Cys Glu Gly Glu 

40 

TCT GAA TAT GCC AGC AAA GTG GAT GAA TAT GTT GGT GAA GTG GAG AAT 
Ser Glu Tyr Ala Ser Lys Val Asp Glu Tyr Val Gly Glu Val Glu Asn 

GAT CTC CAG AAG TCT AAG GTT GCT GTT TCC 
45 Asp Leu Gin Lys Ser Lys Val Ala Val Ser 
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This is done, however, with the understanding that the present invention extends to 
an equivalent or substantially similar nucleic acid molecule from any other plant. 
By "equivalent" and "substantially similar" is meant at the level of nucleotide 
sequence, amino acid sequence, antibody reactivity, monomer composition and/or 
5 processing of the precursor to produce monomers. For example, a nucleotide 
sequence having a percentage sequence similarity of at least 55%, such as about 60- 
65%, 70-75%, 80-85% and over 90% when compared to the sequence of SEQ ID 
NO. 1 would be considered "substantially similar" to the subject nucleic acid 
molecule provided that such a substantially similar sequence encodes a PI precursor 
having at least three monomers and preferably four, five or six monomers as 
hereinbefore described. 



10 



In a particularly preferred embodiment, the nucleic acid molecule further encodes 
a signal sequence 5' to the open reading frame and/or a nucleotide sequence 3' of 
15 the coding region providing a full nucleotide sequence as follows (SEQ ID NO. 2): 

CGAGTAAGTA TGGCTGTTCA CAGAGTTAGT TTCCTTGCTC TCCTCCTCTT ATTTGGAATG 
TCTCTGCTTG TAAGCAATGT GGAACATGCA GATC 

20 





TGT 
Cys 


GAT 
Asp 


CCA 
Pro 


AGA 
Arg 


ATT 
He 


GCC 
Ala 


TAT 
Tyr 


GGA 
Gly 


GTT 
Val 


TGC 
Cys 


25 


AAG 
Lys 


AAT 
Asn 


GAT 
Asp 


CGG 
Arg 


ATA 

He 


TGC 
Cys 


ACC 
Thr 


AAC 
Asn 


TGT 
Cys 


TGC 
Cys 


30 


AAG 

Lys 


TAC 
Tyr 


TTC 
Phe 


AGT 
Ser 


GAT 
Asp 


GAT 
Asp 


GGA 
Gly 


ACT 
Thr 


TTT 
Phe 


GTT 
Val 




CCT 
Pro 


AGA 
Arg 


AAT 
Asn 


CCA 
Pro 


AAG 
Lys 


GCT 
Ala 


TGT 
Cys 


ACC 
Thr 


TTA 
Leu 


AAC 
Asn 


35 


TAT 
Tyr 


CCA 
Gly 


GTT 
Val 


TGC 
Cys 


CCG 
Pro 


CGT 
Arg 


TCA 
Ser 


GAA 
Glu 


GAA 
Glu 


AAG 

Lys 




ACC 
Thr 


AAC 

Asn 


TGT 
Cys 


TGC 
Cys 


GCA 
Ala 


GGC 
Gly 


ACG 
Thr 


AAG 
Lys 


GGT 
Gly 


TGT 
Cys 



AAG 


GCT 


TGT 


ACC 


TTA 


AAC 


Lys 


Ala 


Cys 


Thr 


Leu 


Asn 


CCG 


CGT 


TCA 


GAA 


GAA 


AAG 


Pro 


Arg 


Ser 


Glu 


Glu 


Lys 


GCA 


GGC 


ACG 


AAG 


GGT 


TGT 


Ala 


Gly 


Thr 


Lys 


Gly 


Cys 


TGT 


GAA 


GGA 


GAG 


TCT 


GAT 


Cys 


Glu 


Gly 


Glu 


Ser 


Asp 


TGT 


GAT 


CCA 


AGA 


ATT 


GCC 


Cys 


Asp 


Pro 


Arg 


He 


Ala 


AAC 


AAT 


GAT 


CGG 


ATA 


TGC 


Lys 


Asn 


Asp 


Arg 


He 


Cys 


AAG 


TAC 


TTC 


AGT 


GAT 


GAT 


Lys 


Tyr 


Phe 


Ser 


Asp 


Asp 
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GGA ACT TTT GTT TGT GAA GGA GAG TCT GAT CCT AGA AAT CCA AAG GCT 
Gly Thr Phe Val Cys Glu GLy Glu Ser Asp Pro Arg Asn Pro Lys Ala 

TCT CCT CGG AAT TGC GAT CCA AGA ATT GCC TAT GGG ATT TGC CCA CTT 
5 Cys Pro Arg Asn Cys Asp Pro Arg lie Ala Tyr Gly lie Cys Pro Leu 

GCA GAA GAA AAG AAG AAT GAT CGG ATA TGC ACC AAC TGT TGC GCA GGC 
Ala Glu Glu Lys Lys Asn Asp Arg lie Cys Thr Asn Cys Cys Ala Gly 

10 AAA AAG GGT TGT AAG TAC TTT AGT GAT GAT GGA ACT TTT GTT TGT GAA 
Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr Phe Val Cys Glu 

GGA GAG TCT GAT CCT AAA AAT CCA AAG GCC TGT CCT CGG AAT TGT GAT 

Gly Glu Ser Asp Pro Lys Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp 

15 

GGA AGA ATT GCC TAT GGG ATT TGC CCA CTT TCA GAA GAA AAG AAG AAT 

Gly Arg He Ala Tyr Gly He Cys Pro Leu Ser Glu Glu Lys Lys Asn 

GAT CGG ATA TGC ACC AAC TGC TGC GCA GGC AAA AAG GGT TGT AAG TAC 
20 Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 

TTT AGT GAT GAT GGA ACT TTT GTT TGT GAA GGA GAG TCT GAT CCT AAA 
Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Lys 

25 AAT CCA AAG GCT TGT CCT CGG AAT TGT GAT GGA AGA ATT GCC TAT GGG 
Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Gly Arg He Ala Tyr Gly 

ATT TGC CCA CTT TCA GAA GAA AAG AAG AAT GAT CGG ATA TGC ACA AAC 
He Cys Pro Leu Ser Glu Glu Lys Lys Asn Asp Arg He Cys Thr Asn 

30 

TGT TGC GCA GGC AAA AAG GGC TGT AAG TAC TTT AGT GAT GAT GGA ACT 
Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr 

TTT GTT TGT GAA GGA GAG TCT GAT CCT AGA AAT CCA AAG GCC TGT CCT 
35 Phe Val Cys Glu Gly Glu Ser Asp Pro Arg Asn Pro Lys Ala Cys Pro 

CGG AAT TGT GAT GGA AGA ATT GCC TAT GGA ATT TGC CCA CTT TCA GAA 
Arg Asn Cys Asp Gly Arg He Ala Tyr Gly He Cys Pro Leu Ser Glu 

40 GAA AAG AAG AAT GAT CGG ATA TGC ACC AAT TGT TGC GCA GGC AAG AAG 
Glu Lys Lys Asn Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys 

GGC TGT AAG TAC TTT AGT GAT GAT GGA ACT TTT ATT TGT GAA GGA GAA 
Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr Phe He Cys Glu Gly Glu 

45 
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TCT GAA TAT GCC AGC AAA GTG GAT GAA TAT GTT GGT GAA GTG CAG AAT 
Ser Glu Tyr Ala Ser Lys Val Asp Glu Tyr Val Gly Glu Val Glu Asn 

GAT CTC CAG AAG TCT AAG GTT GCT GTT TCC TAAGTCCTAA CT AAT AAT AT 
5 Asp Lea Gin Lys Ser Lys Val Ala Val Ser 

GTAGTCTATG TATGAAACAA AGG CATC CCA ATATCCTCTG TCTTGCCTGT AATCTGTAAT 

ATGCTAGTGG AGCTTTTCCA CTGCCTGTTT AATAAGAAAT GGACCACTAC TTTGTTTTAG 

10 

TTAAAAAAAA AAAAAAAAAA 

including substantially similar variants thereof. 

1 5 Accordingly, a preferred embodiment of the present invention provides a nucleic acid 
molecule comprising a sequence of nucleotides as set forth in SEQ ID NO. 1 or 2 
which encodes or is complementary to a sequence which encodes a type II serine PI 
precursor from Nicotiana alata or having at least 55% similarity to said precursor or 
at least one domain therein wherein said precursor comprises a signal peptide and 

20 at least five monomers and wherein one of said monomers has a chymotrypsin 
specific site and four of said monomers have trypsin specific sites. 

In still a more preferred embodiment, the nucleic acid molecule is a cDNA molecule 
and comprises a nucleotide sequence generally as set forth in SEQ ID NO. 1 or 2 
25 or being substantially similar thereto as hereinbefore defined to the whole of said 
sequence or to a domain thereof. 

Another aspect of the present invention is directed to a nucleic acid molecule 
comprising a sequence of nucleotides which encodes or is complementary to a 
30 sequence which encodes a single type II serine PI having either a chymotrypsin 
specific site or a trypsin specific site and wherein said PI is a monomer of a 
precursor PI having at least three monomers of which at least one of said monomers 
has a chymotrypsin site and the other of said monomers has a trypsin site. Preferably, 
however, the precursor has four, five or six monomers and is as hereinbefore defined. 

35 
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In its most preferred embodiment, the plant is N. data (Link et Otto) having self- 
incompatibility genotype S^, S 3 S 3 or S 6 S 6 , and the nucleic acid molecule is isolatable 
from or complementary to genetic sequences isolatable from stigmas and styles of 
5 mature plants. The corresponding mRNA is approximately 1 .4 kb and the cDNA has 
six conserved domains wherein the first two domains are 100% identical and contain 
chymotrypsin-specific sites (Leu-Asn). The third, fourth and fifth domains share 95- 
98% identity and have sites specific for trypsin (Arg-Asn). A sixth domain ^hich 
also has a trypsin specific site has less identity to the third, fourth and fifth domains 
10 (79-90%) due mainly to a divergent 3* sequence (see Table 1). The preferred PI 
inhibitor of the present invention has a molecular weight of approximately 42-45 kDa 
with an approximately 29 amino acid signal sequence. 

The N-terminal sequence of the monomeric PI i< represented in each of the six 
15 repeated domains in the predicted sequence of the PI precursor protein. Thus, it is 
likely that the PI precursor protein is cleaved at six sites to produce seven peptides. 
Six of the seven peptides, peptides 2, 3, 4, 5, 6 and 7 (Figure 1, residues 25-82 [SEQ 
ID NO. 5], 83-140 [SEQ ID NO. 6], 141-198 [SEQ ID NO. 7], 199-256 [SEQ ED NO. 
8], 257-314 [SEQ ID NO. 9] and 315-368 [SEQ ID NO. 9], respectively), would be 
20 in the same molecular weight range as the monomeric PI (about 6 kDa) and would 
have the same N-terminal sequence. Peptide 7 does not contain a consensus site for 
trypsin or chymotrypsin. Peptide 1 (residues 1-24 [SEQ ID NO. 4], Figure 1) is 
smaller than 6 kD, has a different N-terminus and was not detected in a purified 
monomeric PI preparation. It could be envisaged that peptide 1 and peptide 7 
25 would form a functional proteinase inhibitor with the inhibitory site on peptide 1 
held in the correct conformation by disulphide bonds formed between the two 
peptides. 

Although not intending to limit the present invention to any one hypothesis, the PI 
30 precursor may be processed by a protease responsible, for example, for cleavage of 
an Asn-Asp linkage, to produce the bioactive monomers. More particularly, the 
protease sensitive sequence is Rf X 1 -X 2 -Asn-Asp-R 2 where Rj, X t and X 2 are 
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defined below. The discovery of such a sequence will enable the engineering of 
peptides and polypeptides capable of being processed in a plant by cleavage of the 
protease sensitive sequence. According to this aspect of the present invention there 
is provided a protease sensitive peptide comprising the amino acid sequence: 

5 

"Xj -X 2 - AsrrAsp - 

wherein Xj and X 2 are any amino acid but are preferably both Lys residues. The 
protease sensitive peptide may also be represented as: 

10 

Rj-Xj-Xj-Asn-Asp-Rj 

wherein X x and X 2 are preferably the same and are preferably both Lys residues and 
wherein Rj and R 2 are the same or different, any D or L amino acid, a peptide, a 

15 polypeptide, a protein, or a non-amino acid moiety or molecule such as, but not 
limited to, an alkyl (eg methyl, ethyl), substituted alkyl, alkenyl, substituted alkenyl, 
acyl, dienyl, arylalkyL, arylalkenyl, aryl, substituted aryl, heterocyclic, substituted 
heterocyclic, cycloalkyl, substituted cycloalkyl, halo (e.g. CI, Br, I, F), haloalkyl, nitro, 
hydroxy, thiol, sulfonyl, carboxy, alkoxy, aryloxy and alkylaryloxy group and the like 

20 as would be apparent to one skilled in the art. By alkyl, substitued alkyl, alkenyl and 
substituted alkenyl and the like is meant to encompass straight and branched 
molecules, lower (Cj - C 6 ) and higher (more than C 6 ) derivatives. The term 
"substituted" includes all the substituents set forth above. 

25 In its most preferred embodiment, the protease sensitive peptide is: 

R 1 -X 1 -X 2 -Asn-Asp-R 2 

wherein Rj and R 2 are the same or different and are peptides or polypeptides and 
30 wherein X x and X2 are both Lys residues. 
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Such a protease sensitive peptide can be placed between the same or different 
monomers so that upon expression in a suitable host or m vitro, the larger molecule 
can be processed to the peptides located between the protease sensitive peptides. 

5 

The present invention also extends to a nucleic acid molecule comprising a sequence 
of nucleotides which encodes or is complementary to a sequence which encodes a 
protease sensitive peptide comprising the sequence: 

10 -XfX^Asn-Asp- 

wherein Xj and X 2 are preferably the same and are most preferably both Lys 
residues. Such a nucleic acid molecule may be part of a larger nucleotide sequence 
encoding, for example, a precursor polypeptide capable of being processed ^ia the 
15 protease sensitive sequence into individual peptides or monomers. 

The protease sensitive peptide of the present invention is particularly useful in 
generating poly and/ or multi-valent "precursors" wherein each monomer is the same 
or different and directed to the same or different activities such as anti-viral, ami- 
20 bacterial, antifungal, anti-pathogen and /or anti-predator activity. 

Although not wishing to limit this aspect of the invention to any one hypothesis or 
proposed mechanism of action, it is believed that the protease acts adjacent the Asn 
residue as more particularly between the Asn-Asp residues. 

25 

The present invention extends to an isolated type II serine PI precursor from a plant 
wherein said precursor comprises at least three PI monomers and wherein at least 
one of said monomers has a chymotrypsin specific site and at least one other of said 
monomers has a trypsin specific site. Preferably, the PI precursor has four, five or 
30 six monomers and is encoded by the nucleic acid molecule as hereinbefore described. 
The present invention also extends to the individual monomers comprising the 
precursor. The present invention also extends to a hybrid recombinant PI precursor 
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molecule comprising at least two monomers from different Pis as hereinbefore 
described. 

The isolated PI or PI precursor may be in recombinant form and/or biologically 
pure. By "biologically pure" is meant a preparation of PI, PI precursor and/or any 
mixtures thereof having undergone at least one purification step including 
ammonium sulphate precipitation, Sephadex chromatography and/or affinity 
chromatography. Preferably, the preparation comprises at least 20% of the PI, PI 
precursor or mixture thereof as determined by weight, activity antibody, reactivity 
and/or amino acid content. Even more preferably, the preparation comprises 30- 
40%, 50-60% or at least 80-90% of PI, PI precursor or mixture thereof. 



The PI or its precursor may be naturally occurring or be a variant as encoded by the 
nucleic acid variants referred to above. It may also contain single or multiple 
15 substitutions, deletions and/or additions to its amino acid sequence or to non- 
proteinaceous components such as carbohydrate and/or lipid moieties. 

The recombinant and isolated PI, PI precursor and mixtures thereof are useful as 
laboratory reagents, in the generation of antibodies, in topically applied insecticides 
20 as well as orally ingested insecticides. 

The recombinant PI or PI precursor may be provided as an insecticide alone or in 
combination with one or more carriers or other insecticides such as the BT crystal 
protein. 



The PI of the present invention is considered to have a defensive role in organs of 
the plant, for example, the stigma, against the growth or infection by pests and 
pathogens such as fungi, bacteria and insects. There is a need, therefore, to develop 
genetic constructs which can be used to generate transgenic plants capable of 
expressing the PI precursor where this can be processed into monomers of a 
monomeric PI itself. 
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Accordingly, another aspect of the present invention contemplates a genetic construct 
comprising a nucleic acid molecule comprising a sequence of nucleotides which 
encodes or is complementary to a sequence which encodes a type II serine PI 
precursor or monomer thereof from a plant wherein said precursor comprises at least 
5 three PI monomers and wherein at least one of said monomers has a chymotrypsin 
specific site and at least one of said other monomers has a trypsin specific site and 
said genetic sequence further comprises expression means to permit expression of 
said nucleic acid molecule, replication means to permit replication in a plant cell or, 
alternatively, integration means, to permit stable integration of said nucleic acid 

10 molecule into a plant cell genome. Preferably, the expression is regulated such as 
deveiopmentally or in response to infection such as being regulated by an existing 
PI regulatory sequence. Preferably, the expression of the nucleic acid molecule is 
enhanced to thereby provide greater endogenous levels of PI relative to the levels 
in the naturally occurring plant. Alternatively, the PI precursor cDNA of the present 

15 invention is usable to obtain a promoter sequence which can then be used in the 
genetic construct or to cause its manipulation to thereby permit over-expression of 
the equivalent endogenous promoter. In another embodiment the PI precursor is a 
hybrid molecule as hereinbefore described 

20 Yet another aspect of the present invention is directed to a transgenic plant carrying 
the genetic sequence and/or nucleic acid molecule as hereinbefore described and 
capable of producing elevated, enhanced or more rapidly produced levels of PI 
and/ or PI precursor or hybrid PI precursor when required- Preferably, the plant is 
a crop plant or a tobacco plant but other plants are usable where the PI or PI 

25 precursor nucleic acid molecule is expressable in said plant. Where the transgenic 
plant produces PI precursor, the plant may or may not further process the precursor 
into monomers. Alternatively, the genetic sequence may be part of a viral or 
bacterial vector for transmission to an insect to thereby control pathogens in insects 
which would consequendy interrupt the transmission of the pathogens to plants. 

30 
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In still yet another aspect of the present invention, there is provided antibodies to 
the PI precursor or one or more of its monomers. Antibodies may be monoclonal 
or polyclonal and are useful in screening for PI or PI precursor clones in an 
5 expression library or for purifying PI or PI precursor in a fermentation fluid, 
supernatant fluid or plant extract. 

The genetic constructs of the present invention can also be used to populate the gut 
of insects to act against the insect itself or any plant pathogens therein or to 
10 incorporate into the gut of animals to facilitate the digestion of plant material. 

The present invention is further described by reference to the following non-limiting 
Figures and Examples. 

15 In the Figures: 

Figure 1 shows the nucleic acid sequence (SEQ ID NCX 2) of the pNA-PI-2 insert 
and the corresponding amino acid sequence (SEQ ID NO. 3) of the N. alata PI 
protein. The amino acid sequence is numbered beginning with 1 for the first amino 

20 acid of the mature protein. The signal sequence is encoded by nucleotides 1 to 97 
and the amino acid residues have been assigned negative numbers. The reactive site 
residues of the inhibitor are boxed. The N. alata PI sequence contains six similar 
domains (domain 1, residues 1 to 58, domain 2, residues 59-116, domain 3, residues 
117-174, domain 4, residues 175-232, domain 5, residues 233-290 and domain 6, 

25 residues 291-343). 

Figure 2 is a photographic representation showing a gel blot analysis of RNA from 
various organs of N. alata Gel Blot of RNA isolated from organs of N. alata and 
from stigmas and styles of M tabacum and AT. sylvestris, hybridised with the cDNA 
30 clone NA-PI-2. St, stigma and style; Ov, ovaries; Po, pollen; Pe, petals; Se, 
sepals; L, non-wounded leaves; L4, leaves 4h after wounding; L24, leaves 24h after 
wounding; Nt, N. tabacum stigma and style; Ns, N. sylvestris stigma and style; Na 
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Hvrilll restriction fragments of Lambda-DNA. 

The NA-PI-2 clone hybridised to 2 mKNA species (1.0 and 1.4kb). The larger 
mRNA was predominant in stigma and styles, whereas the smaller mRNA species 
5 was more dominant in other tissues. After high stringency washes, the l.Okb mRNA 
from stigma and style no longer hybridises to the NA-PI-2 probe. 

Figure 3 is a photographic presentation depicting in situ localisation of RNA 
homologous to NA-PI-2 in stigma and style. 

(a) Autoradiograph of a longitudinal cryosection through the stigma and style of 
a 1cm long bud after hybridisation with the ^P-labelled NA-PI-2 cDNA 
probe. 

(b) The same section as (a), stained with toluidine blue, c, cortex; v, vascular 
bundles; tt, transmitting tract; s, stigmatic tissue. 

The cDNA probe labelled the cells of the stigma heavily and some hybridisation to 
the vascular bundles can be seen. There was no hybridisation to the epidermis, 
cortical tissue or transmitting tissue. Scale bars = 200 ym. 

20 Figure 4 is a photographic representation of a gel blot analysis of genomic DNA of 
N. alata Gel blot analysis of M alata genomic DNA digested with the restriction 
enzymes EcdRl or Hin&IU, and probed with radiolabeled NA-PI-2. Size markers 
(kb) are Hindlll restriction fragments of Lambda-DNA. 

25 £ccRI produced two hybridising fragments (llkb and 7.8kb), while ffirrilll gave 
three large hybridising fragments (16.6, 13.5 and 10.5kb). The NA-PI-2 clone 
appears to belong to a small multigene family consisting of at least two members. 

Figure 5 is a graphic representation of PI activity in various organs of N. alata 
30 Buffer soluble extracts from various organs were tested for their ability to inhibit 
trypsin and chymotrypsin. Stigma and sepal extracts were the most effective 
inhibitors of both trypsin (A) and chymotrypsin (B). 



10 



15 
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Figure 6 depicts the steps of the purification of PI from N. alata stigmas. 

(a) Sephadex G-50 gel filtration chromatography of ammonium sulphate 

precipitated proteins from stigma extracts. The PI activity eluted late in the 

profile. 

5 (b) 20% w/v SDS-polyacrylamide gel (Laemmli, 1970) of fractions across the gel 
filtration column. The gel was silver stained and molecular weight markers 
(Pharmacia peptide markers) are in kilodaltons. A protein of about 6kD 
(arrowed) coelutes with the proteinase inhibitor activity, 
(c) Analysis of Pi-containing fractions at different stages of the purification 
10 procedure, by SDS-PAGE. Lane 1, crude stigma extract (5pg); Lane 2, 

stigma proteins precipitated by 80% w/v ammonium sulphate (5pg); Lane 
3, PI protein eluted from the chymotrypsin affinity column (lyg). 

The PI is a 6kD protein and is a major component in unfractionated buffer soluble 
15 extracts from stigmas. 

Figure 7 is a graphical representation showing hydropathy plots of the PI proteins 
encoded by the NA-PI-2 clone from N. alata and the potato and tomato PI II 
cDNAs. Values above the line denote hydrophobic regions and values below the 
20 line denote hydrophilic regions. The putative signal peptides are shaded. The 
hydrophobicity profile was generated using the predictive rules of Kyte and Doolittle 
(1982) and a span of 9 consecutive amino acids. 

(a) Hydropathy profile of the JV. data PI protein. The six repeated domains in 
the predicted precursor protein are labelled I- VI. The hydrophilic regions 

25 containing the putative cleavage sites for production of the 6kD PI species are 

arrowed. The regions corresponding to the peptides that would be produced 
by cleavage at these sites are marked C for chymotrypsin inhibitor, T for 
trypsin inhibitor and x for the two flanking peptides. 

(b) Hydropathyprofile of the potato PI K protein. (Sanchez-Serrano et aL f 1986). 
30 The two repeated domains in the PI II protein are labelled I and EL The 

putative cleavage sites for production of PCM are arrowed (Hass et aL t 1982) 
and the region spanned by PCM is marked. 
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(c) Hydropathy profile of the polypeptide encoded by the tomato PI II cDNA. 
(Graham et aL, 1985). The two domains are labelled, I and II and the 
residues which would be potential processing sites are arrowed. These sites 
are not present in regions predicted to be hydrophilic and consequently a 
5 cleavage product is not marked. 

Figure 8 shows an immunoblot analysis of the PI protein in stigmas of developing 
flowers. 

(a) Developing flowers of N. alata 
10 (b) SDS-PAGE of stigma proteins at the stages of development shown in (a) 5 
pg of each extract was loaded. The peptide gel was silver stained and 
molecular weight markers (LKB Low Molecular weight and Pharmacia 
peptide markers) are in kilodakons. 

(c) Immunoblot of a gel identical to (b), probed with anti-PI antiserum. 

15 

Stigmas from developing flowers contain four proteins of approximately 42kD, 32kD, 
18kD and 6kD that bind to the anti-PI antibody. The 42kD and the 18kD 
components decrease in concentration as the flowers mature, while the 6kD PI 
protein reaches a maximum concentration just before anthesis. The level of the 
20 32kD component, which runs as a doublet, does not alter significantly during flower 
development. 

Figure 9 shows the separation and identification of the 6kD proteinase inhibitor 
species from N.alata sdgmas 
25 A. Separation of the 6kD Pis by reversed phase HPLC chromatography 

Four major peaks were obtained with retention times of about 15.5min(peakl), 
20J5min(peak2), 22.5min(peak3), 24min(peak4). The peptides in each peak have 
been identified by a combination of N-terminal analysis and mass spectrometry. See 
B for description of CI and T1-T4. 

30 
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B. The five homologous peptides produced from the PI precursor protein: CI, 
chymotrypsin inhibitor, T1-T4 trypsin inhibitors. The solid bars represent the 
reactive sites of the inhibitors. The precursor protein is drawn minus the signal 
5 sequence. ||§| region of the six repeated domains (amino acids 1-343, Fig.l). 

non-repeated sequence (amino acids 344-368, Fig.l). The arrows point to the 
processing jites in the precursor protein. 

C The amino acid sequence of CI and T1-T4 predicted from the cDNA clone and 
10 confirmed by N-terminal sequencing of the purified peptides. The amino acid at the 
carboxy-terminus of each peptide was obtained by accurate mass determination using 
an electro-spray mass spectrometer. The CI and Tl inhibitors differ by five amino 
acids (bold). Two of these amino acids are located at the reactive site (underlined) 
and the other two to three reside at the carboxy-terminus. Peptides T2-T4 have 
15 changes in three amino acids (boxed) that are conserved between CI and Tl. 
Peptides T2 and T3 are identical to each other. Mass spectrometry was used to 
demonstrate that other forms of CI and T1-T4 occur due to non-precise trimming 
at the N- and C-termini. That is, some forms are missing residue 1 or residue 53 
and others are missing both residue 1 and 53 (see Table 2). 

20 

Figure 10 shows the amino-acid sequence around the processing sites in the 
precursor PI protein. 

The sequence in bold is the amino-terminal sequence obtained from the purified PI 
protein. The sequence labelled with negative numbers is the flanking sequence 
25 predicted from the cDNA clone. The predicted precursor protein contains six 
repeats of this sequence. 

Figure 11 shows the PI precursor produced in a baculovirus expression system and 
the products obtained after digestion of the affinity purified PI precursor by the 
30 endoproteinase Asp-N. 
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A- The PI precursor produced by the recombinant baculovirus. 
Immunoblot containing affinity and HPLC purified PI precursor from Moicfa stigmas 
at the green bud stage of development (lane 1) and affinity purified PI precursor 
5 produced by the recombinant baculovirus (lane 2). Proteins were fractionated by 
electrophoresis on a 15% w/v SDS-polyacryl amide gel prior to electrophoretic 
transfer to nitrocellulose. The blot was incubated with the antibody raised in rabbits 
to the 6kD PI species from stigmas. The recombinant virus produced an 
immunorective protein of 42kD that is the same size as the PI precursor protein 
10 produced by stigmas (arrowed). 

B. Cleavage of the PI precursor by endoproteinase Asp-N. 

15% SDS-polyacrylamide gel stained with silver containing: 1, PI precursor, 
produced by baculovirus, incubated without enzyme. 2, enzyme incubated without 

15 precursor. 6kD, PI peptides of about 6kD purified from N.alata stigmas. 1m, 5m, 
30m, reaction products produced after 1, 5 and 30 minutes of incubation* 2h and 
24h, reaction products after 2 and 24h of incubation. Peptides of about 6-7kD were 
detected within one minute of incubation of the precursor with the enzyme. After 
24h only peptides of 6-7kD were detected. The bands smaller than 42kD in track 

20 1 are due to truncated forms of the precursor produced by premature termination 
of translation in the baculovirus expression system. 

Figure 12 Preparative chromatography by reversed phase HPLC of the peptides 
produced from the precursor by Asp-N digestion 

25 

HPLC profile of peptides produced by Asp-N digestion of the PI precursor. The 
major peaks had a retention time of 19 min (termed Asp-Nl) and 21 min (termed 
Asp-N2). Hie peptides in these peak fractions (1 & 2) had a slightly slower mobility 
on SDS-PAGE than the 6kD peptides from stigmas (C, inset). The proteinase 
30 inhibitory activity of Asp-Nl and Asp-N2 was tested against trypsin and 
chymotrypsin. 
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Figure 13 shows a comparison of the trypsin and chymotrypsin inhibition activity of 
the PI precursor, PI peptides from stigmas and in vitro produced PI peptides from 
the PI precursor. 

PI precursor or PI peptides (Ol.Ojjg) were tested for their ability to inhibit 1.0>ig of 
5 trypsin or chymotrypsin as described in the materials and methods. Inhibitory 
activity is expressed as the percentage of proteinase activity remaining after the 
proteinase had been preincubated with the PI with 100% remaining activity taken 
as the activity of the proteinase preincubated with no PI. Experiments were 
performed in duplicate and mean values were plotted. Deviation from the mean was 
10 8% or less. 



Figure 14 is a graphical representation showing a growth curve for T. commodus 
nymphs reared on control artificial diet, soybean Bowman-Birk inhibitor and N. alata 
PI. The vertical axis represents the mean weight of the crickets in each treatment 
15 (+ /- standard error) in mg. The horizontal axis represents the week number. The 
crickets reared on the N. alalaVl showed a lower mean weight than those reared on 
both the control diet and the diet containing the soybean inhibitor, throughout the 
experiment. 
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SUMMARY OF SEQ ID NOs 


SEQ ID NO. 1 


Nucleotide coding region of N. alata PI precursor 


SEQ ID NO. 2 


Full length nucleotide sequence of N. alata PI precursor 


SEQ ID NO. 3 


Amino acid sequence corresponding to SEQ ID NO. 1 


SEQ ID NO. 4 


Residues 1-24 of SEQ ID NO. 2 (peptide 1) 


SEQ ID NO. 5 


Residues 25-82 of SEQ ID NO. 2 (peptide 2) 


SEQ ID NO. o 


Residues 83-140 of SEQ ID NO. 2 (peptide 3) 


SEQ ID NO. 7 


Residues 141-198 of SEO ID NO 2 CoeDtide 4^ 


SEQ ID NO. 8 


Residues 199-256 of SEQ ED NO. 2 (peptide 5) 


SEQ ID NO. 9 


Residues 257-314 of SEQ ED NO. 2 (peptide 6) 


SEQ ID NO. 10 


Residues 315-368 of SEQ ID NO. 2 (peptide 7) 


SEQ ID NO. 11 


N-terminal amino acid sequence of 6kD PI protein 


SEQ ID NO. 12 


N-terminal amino acid sequence of 6kD PI protein 




EXAMPLE 1 




1. MATERIALS AND METHODS 


Plant Material 





25 Nicotiana alata (Link et Otto) plants of self-incompatibility genotype SjS 3 , S 3 S 3 and 
S 6 S 6 were maintained under standard glasshouse conditions as previously described 
(Anderson et al, 1989). Organs were collected directly into liquid Nitrogen to avoid 
induction of a wound response and stored at -70 ° until required. To study the effect 
of wounding on gene expression, leaves were wounded by crushing across the mid- 
30 vein with a dialysis clip. Leaves were collected 4 and 24 hours after wounding. 
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Identification and sequencing of a cDNA clone encoding PI 
Polyadenylated RNA was prepared from stigmas and styles, isolated from mature 
flowers of N. alata (genotype S^), and used to construct a cDNA library in Lambda 
gtlO (Anderson et al, 1989). Single stranded ^P-labelled cDNAwas prepared from 
mRNA from stigmas and styles of N. alaia (genotype S 3 S 3 and S^) and used to 
screen the library for highly expressed clones which were not S-genotype specific 
(Anderson et al, 1989). Plaques which hybridised strongly to cDNA probes from 
both S-genorypes were selected and assembled into groups on the basis of cross- 
hybridisation. The longest clone of each group was subcloned into M13mpl8 and 
pGEM 3zf+, and sequenced using an Applied Biosystems Model 373A automated 
sequencer. Both dye primer and dye terminator cycle sequencing chemistries were 
performed according to standard Applied Biosystems protocols. Consensus 
sequences were generated using SeqEd™ sequence editing software (Applied 
Biosystems). The GenBank database was searched for sequences homologous to 
these clones. Because of the high degree of sequence similarity between the six 
domains of the N. alaia PI clone, sequencing primers were made to non-repeated 3' 
sequences (nucleotides 1117-1137, 1188-1203 and 1247-1267), and to a 5' sequence 
before the start of the repetitive regions (nucleotides 74-98). In addition, the pNA- 
20 PI-2 insert was restricted with endonuclease /fadll, which cut at nucleotides 622 and 
970 to produce three fragments. The fragments were subcloned into pGEM7zf+ and 
sequenced in both directions, using the M13 forward and reverse primers. The 
repetitive nature of the pNA-PI-2 insert rendered it unstable in both phagemid and 
plasmid vectors when cultures were grown longer than 6 hours. 



15 



25 



30 



RNA Gel Blot Analysis 

Total RNA was isolated and separated on a 12% w/v agarose/formaldehyde gel as 
previous described (Anderson et al, 1989). The RNA was transferred to Hybond-N 
(Amersham) and probed with the insert from pNA-PI-2 labeUed with ^P using 
random hexanucleotides (1 x 10* cpm pg >; 1 x 10 7 cpm ml-i)(Feinberg and 
Vogelstein, 1983). Prehybridisation and hybridisation, at 68 »C, were as described 
by Anderson et al (1989). The filters were washed in 2 x SSC, 0.1% w/v SDS or 0.2 
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x SSC, 1% w/v SDS at 68 °C 

in rite hybridisation 

hybridisation was performed as described by Cornish et al y 1987. The probe 
5 was prepared by labelling the insert from pNA-PI-2 (lOOng) to a specific activity of 
10 8 cpm pg' 1 by random hexanucleotide priming (Feinberg and Vogelstein, 1983). 
The labelled probe was precipitated, and resuspended in hybridisation buffer (50yl), 
and 5]ol was applied to the sections. The sections were covered with coverslips, and 
incubated overnight at 40 °C in a closed box containing 50% v/v formamide. After 

10 incubation, sections were washed sequentially in 4 x SSC at room temperature, 2 x 
SSC at room temperature, and 1 x SSC at 40 °C for 40 min. The slides were dried 
and exposed direcdy to X-ray film (Cronex MRF 32, Dupont) at room temperature, 
overnight. Hybridised sections were counterstained with 0.025% w/v toluidine blue 
in H 2 0, and mounted in Eukitt (Carl Zeiss, Freilburg, FRG). Autoradiography were 

15 transposed over sections to give the composites shown. 

DNA Gel Blot Analysis 

Genomic DNA was isolated from young leaves of N. data by the procedure of 
Bernatzky and Tanksley (1986). DNA (lOpg) was digested to completion with the 
20 restriction endonncleases EcdRl or HindYLl, separated by electrophoresis on a 0.9% 
w/v agarose gel, and transferred to Hybond-N (Amersham) by wet blotting in 20 x 
SSC. Filters were probed and washed as described for RNA blot analysis. 

Preparation of protein extracts 

25 Soluble proteins were extracted from plant material by freezing the tissue in liquid 
N 2 , and grinding to a fine powder in a mortar and pestle. The powdered tissue was 
extracted in a buffer consisting of lOOmM Tris-HCl, pH 8.5, lOmM EDTA, 2mM 
CaCl^ 14yM p-mercaptoethanol. Insoluble material was removed by centrifugation 
at 10,000g for 15 min. Protein concentrations were estimated by the method of 

30 Bradford (1976) with Bovine Serum Albumin (BSA) as a standard. 
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Proteinase inhibition assays 

Protein extracts and purified protein were assayed for inhibitory activity against 
trypsin and chymotrypsin as described by Rickauer et al (1989). Inhibitory activity 
5 was measured against Ipgof trypsin (TPCK-treated; Sigma) or 3 pg of chymotrypsin 
(TLCK-treated; Sigma). The rate of hydrolysis of synthetic substrates N-a-P-tosyl-L- 
arginine methyl ester (TAME) and N-benzoyl-L-tyrosine ethyl ester (BTEE) by 
trypsin and chymotrypsin, respectively, were taken as the uninhib ited activity of the 
enzymes. Inhibitory activity of the extract was expressed as the percentage of control 
10 protease activity remaining after the protease had been pre-incubated with the 
extract. The PI peptides from stigma, PI precursor and Asp-N processed peptides 
were assayed for inhibitory activity as described by Christeller el al (1989). 

Purification of the N. data PI protein 

15 Stigmas (1000; lOg) were ground to a fine powder in liquid and extracted in 
buffer (lOOmM Tris-HCl, pH8.5, lOmM EDTA, 2mM CaCl^ 14pM p- 
mercaptoethanol, 4ml/g tissue). To concentrate the extract prior to the first 
purification step, gel filtration, the inhibitory activity was precipitated with 80% w/v 
amm onium sulphate, the concentration required to precipitate all the proteinase 

20 inhibitory activity. 

The ammonium sulphate pellet was resuspended in 5ml of 0.15M KC1, lOmM Tris- 
HC1, pH 8.1, and loaded onto a Sephadex G-50 column (2cm x 100cm) equilibrated 
with the same buffer. The fractions (10ml) eluted from this column and containing 

25 proteinase inhibitory activity were pooled and applied to an affinity column of 
Chymotrypsin-Sepharose CL4B [lOOmg TLCK-treated a-chymotrypsin (Sigma) cross- 
linked to 15ml Sepharose CL4B (Pharmacia) by manufacturers instructions]. The 
column was washed with 10 volumes of 0.15M KCl/lOmM Tris-HCl, pH 8.1, prior 
to elution of bound proteins with 7m urea, pH 3 (5 ml fractions). The eluate was 

30 neutralised immediately with 200 pi 1M Tris-HCl pH 8, and dialyzed extensively 
against deionised H 2 0. 
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Amino acid sequence analysis 

Purified PI protein was chromatographed on a reverse phase HPLC microbore 
column prior to automated Edman degradation on a gas phase sequencer (Mau et 
aL 9 1986). Phenylthiohydantoin (PTH) amino acids were analysed by HPLC as 
5 described by Grego et aL (1985). 

Production of a polyclonal antiserum to the N. alata PI 

The purified proteinase inhibitor (Figure 6c, lane 3) was conjugated to a carrier 
protein, keyhole limpet haemocyanin (KLH) (Sigma), using glutaraldehyde, as 

10 follows, lmg of PI protein was dissolved in 1.5ml H 2 0, and mixed with 0.3 mg KLH 
in 0.5 ml of 0.4M phosphate buffer, pH7.5. 1ml of 20mM glutaraldehyde was added 
dropwise over 5 min, with stirring at room temperature. The mixture was stirred for 
30 min at room temperature, 0.25ml of glycine was added, and the mixture was 
stirred for a further 30 min. The conjugated protein was then dialyzed extensively 

15 against normal saline (0.8% w/v NaCl). The equivalent of 100pg of PI protein was 
used for each injection. Freund's complete adjuvant was used for the first injection, 
and incomplete adjuvant for two subsequent booster injections. The IgG fraction of 
the antiserum was separated on Protein A Sepharose (Pharmacia) according to 
manufacturer's instructions. 

20 

Protein Gel Blot Analysis 

Protein extracts were electrophoresed in 15% w/v SDS-polyacrylamide gels 
(Laemmli, 1970) and transferred to nitrocellulose in 25mM Tris-HCl, 192mM glycine, 
20% v/v methanol, using a BioRad Trans-Blot R Semi-dry electrophoretic transfer cell 

25 (12V, 20 min). Loading and protein transfer were checked by staining the proteins 
on the membranes with Ponceau S (Harlow and Lane, 1988). Membranes were 
blocked in 3% w/v bovine serum albumin for lh, and incubated with the anti-PI 
antibody (2pg/ml in 1% w/v BSA, Tris Buffered Saline) overnight at room 
temperature. Bound antibody was detected using biotinylated donkey anti-rabbit IgG 

30 (1 / 500 dilution, Amersham) and the Amersham Biotin-Streptavidin system according 
to procedures recommended by the manufacturer. 
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Proteolysis of the PI precursor by endoproteinase Asp-N 

Affinity purified PI precursor (1.25mg) was incubated at 37°C with endoproteinase 
Asp-N (2yg) in lOOmM NH^CO^ pH 8.5 in a total volume of 1ml for 48h. 
Reaction products were separated by reversed-phase HPLC using an analytical 
5 Brownlee RP-300 Aquapore column (C8, 7pm, 4.6x1 00mm). The column was 
equilibrated in 0.1% v/v TFA and peptides were eluted with the following program: 
0-25%B (60% v/v acetonitrile in 0.089% v/v TFA) applied over 5min, followed by 
a gradient of 25-42%B over the next 40min, and ending with a gradient of 42-1 00%B 
over 5 minutes. The flow rate was l.Oml/min and peptides were detected by 
10 absorbance at 215nm. Each peak was collected manually and freeze dried. 
Concentration was estimated by response obtained with each peak on the UV 
detector at 215nm. 

2. CLONING OF PI PRECURSOR GENE 

15 

Isolation and characterisation of the PI cDNA clone 

A cDNA library, prepared from mRNA isolated from the stigmas and styles of 
mature flowers of N. alata, was screened for clones of highly expressed genes which 
were not associated with self-incompatibility genotype. Clones encoding a protein 

20 with some sequence identity to the type II proteinase inhibitors from potato and 
tomato (Thornburg ei aL, 1987; Graham el aL, 1985) were selected. The largest 
clone, NA-PI-2, is 1360 base pairs long with an open reading frame of 1191 
nucleotides. The nucleic acid sequence (SEQ ID NO. 2) and the predicted amino 
acid sequence (SEQ ID NO. 3) of the N. alata clone, NA-PI-2 is shown in Figure 1. 

25 There are no potential N-glycosylation sites. 

Surprisingly, the N. aia/acDNA clone encodes a protein with six repeated domains 
that have high, but not perfect, sequence identity (Figure 1). Each of these domains 
contains a potential reactive site which is highlighted in Figure 1. The residues at 
30 the putative reactives sites of the N. alata PI are consistent with the inhibitor having 
two sites which would specifically inhibit chymotrypsin (Leu5-Asn6, Leu63-Asn64) 
and four sites specific for trypsin (Argl21-Asnl22, Argl79-Asnl80, Arg237-Asn238 
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and Arg295-Asn296). 

To ensure that the repeat structure of NA-PI-2 was not due to a cloning artifact, 
three additional cDNA clones were sequenced, and found to be identical to NA-PI-2. 

5 

Table 1 is a comparison of the percentage amino acid identity of the six domains of 
the PI precursor. 

Temporal and spatial expression of the PI mRNA 
10 Total RNA, isolated from various tissues of N. olata, was probed with the PI cDNA 

clone in the RNA gel blot analyses shown in Figure 2. Two hybridising messages of 

1.0 and 1.4kb were present in total RNA isolated from styles (including stigmas). 

Only the larger message, which was predominant in this tissue, is of sufficient size 

to encode the cDNA clone NA-PI-2 (1.4Kb). The smaller message is not detected 
15 with the cDNA probe at higher stringency. An homologous message of 

approximately 1 .4kb was also present in RNA isolated from the styles of N. tabacum 

and N. sylvestris (Figure 2). 

In the other floral organs (except pollen), both messages were detectable at low 
20 levels, however, the smaller RNA species appeared more abundant. There was no 
hybridisation to pollen RNA. No hybridising species were evident in leaf RNA, but 
two species, 1.0 and 1.4kb were detected 24 hours after mechanical wounding. The 
smaller message (l.Okb) was more abundant in this case. 

25 In situ hybridisation of radiolabeled N. alata PI cDNA to longitudinal sections of 
styles from immature (1cm long) buds is shown in Figure 3. RNA homologous to 
the cDNA clone bound strongly to cells of the stigma and weakly to vascular 
bundles. No hybridisation was detected in the cortical tissue, transmitting tract 
tissue, or epidermis of the style. The same pattern of hybridisation was observed in 

30 mature receptive flowers. Control sections treated with ribonuclease A prior to 
hybridisation were not labelled. 
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Genomic DNA blot analysis 

The cDNA clone NA-PI-2, was used as a probe on the DNA gel blot shown in 
Figure 4 which contained genomic DNA digested with either EcdRl or Hinim. 
EaRI produced two hybridising fragments (llkb and 7.8kb) and HirdlU produced 
5 three large hybridising fragments (16.6, 13.5 and 10.5 kb). 

Distribution of PI activity in various tissues of N. alata 

The inhibition of trypsin and chymotrypsin by crude extracts of various organs of N. 
alata is shown in Figure 5. Scigma extract was the most effective inhibitor of both 
10 trypsin and chymotrypsin. The stigma extracts had up to eight times more inhibitory 
activity than sepal extracts, and more than 20 times more activity than extracts from 
styles, petals, leaves and wounded leaves. 

Purification of PI from N. alata stigmas 

15 Stigmas of N. alata were extracted in buffer and the inhibitory activity was 
concentrated by precipitation with 80% w/v ammonium sulphate. The precipitate 
was redissolved and fractionated by gel filtration on Sephadex G-50. Most of the 
protein in the extract eluted early in the profile illustrated in Figures 6a and 6b, 
relative to the proteinase inhibitor. Fractions with proteinase inhibitor activity were 

20 pooled and applied to an affinity column of chymotrypsin-Sepharose. The PI activity 
co-eluted with a protein of about 6kD, which appeared to migrate as a single band 
on the 20% SDS-polyacrylamide gel shown in Figure 6c. The purity of the PI at 
various stages of purification was assessed by SDS-PAGE (Figure 6c). The purified 
inhibitor represented approximately 50% of the inhibitory activity present in the 

25 crude extract. 

Amino acid sequence of the N-terminus of the 6kD PI protein 
The N-terminal amino acid sequence DRICTNCCAG(T/K)KG (SEQ ID NO. 11; 
SEQ ID NO. 12, respectively) was obtained from the purified PI protein. This 
30 sequence of amino acids corresponds to six regions in the deduced sequence of the 
cDNA clone, starting at positions 25, 83, 141, 199, 257 and 315 in Figure 1. At 
position 11 of the Nnerminal sequence, both threonine and lysine were detected. 
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This is consistent with the purified inhibitor comprising a mixture of six peptides 
beginning with the sequences underlined in Figure 1, as the first two peptides contain 
threonine at this position, while the other four peptides have lysine at this position 
The position of these peptides relative to the six repeated domains in the predicted 
5 precursor protein is illustrated in Figure 7. Five of the six predicted 6kD peptides, 
contain a reactive site for either chymotrypsin or trypsin (Figure 1 and 7). The sixth 
potential peptide is four amino-acids shorter than the other five peptides (fifty eight 
amino-acids) and may not be active, as it does not contain an inhibitory site. The 
peptide from the N-terminus (x in Figure 7) has a potential chymotrypsin reactive 
10 site but is much shorter (24 amino acids). 

Distribution of the PI protein in N. alala 

A polyclonal antiserum was raised to the purified PI protein conjugated to keyhole 
limpet haemocyanin. The antibody reacted strongly with the purified 6KD PI protein 

15 in inimunoblot analyses and bound only to a 6kD and a 32kD protein, which appears 
as a doublet, in total stigma and style extracts from mature flowers. Figure 8 is an 
immunoblot containing protein extracts of stigmas from flowers at different stages 
of development (1cm long buds to mature flowers) probed with the anti-PI 
antiserum. Larger cross reacting proteins of approximately 18kD, and 42kD were 

20 detected in buds from 1cm to 5cm in length in addition to the 6kD and the 32kD 
protein. The 18kD and 42kD proteins decreased in concentration with maturity, 
while the 6kD protein reached a peak concentration just before anthesis. The 
concentration of the 32kD protein remained relatively constant during flower 
maturation. 

25 
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TABLE 1 
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EXAMPLE 2 

PURIFICATION AND IDENTEFI CATION OF PI MONOMERS 
1. MATERIALS AND METHODS 

Separation of the 6kD PI species by reversed phase chromatography 
Stigmas (21,000) were ground and extracted as described for purification of the PI 
protein. After gel filtration on a Sephadex G-50 gel filtration column (5cmx800cm, 
3000 stigmas per separation) the peptides were lyophilized and applied to a 
Brownlee RP-300 C8 Reversed-phase column, 10x250mm, on a Beckman HPLC 
system Gold, and eluted with 0.1% v/v Trifluoroacetic acid (TFA) and an 
acetonitrile gradient (0-10% over Smins, 10-25% over 40 mins and 25-60% over 10 
mins), at 5ml/min. Peak fractions, designated fraction 1, 2, 3 and 4 were collected 
and freeze dried 

Electrospray mass spectrometry 

On line mass spectrometric analysis of HPLC eluates was performed by application 
of 20 pmoles of each PI preparation (fraction 1, 2, 3 & 4) in 2pl of vater onto a 
Brownlee RP-300 C8 reversed-phase column (150x0.20mm internal diameter fused- 
silica capillary column) on a modified Hewlett- Packard model HP1090L liquid 
chromatograph and elution with a linear gradient of acetonitrile ( 0,05% v/v TFA 
to 0.045% v/v TFA/60% v/v acetonitrile in 30 min.) at a flow rate of lpl/min and 
a column temperature of 25°C. The eluant was monitored at 215nm using a Spectral 
Physics forward optics scanning detector with a 6-mm pathlength U-shaped axial 
beam capillary flow cell (LC Packings, Netherlands). Mass spectra were acquired 
on a Finnigan-Mat triple quadrupole mass spectrometer (modelTSQ-700, San Jose, 
CA) equipped with an electrospray ionisation (ESI) source (Analytica, Branford, 
CT). The electrospray needle was operated in positive ion mode at a voltage 
differential of -4 kV. The sheath liquid was 2-methoxyethanol delivered at lpl/min 
via a syringe drive (Harvard Apparatus, South Natick, MA). The nitrogen drying gas 
conditions were as follows: heater temperature, 275°C; pressure, 15 psi; flow rate, - 
15stdL/min. The nitrogen sheath gas was supplied at 33 psi. Gaseous nitrogen was 
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obtained from a boiling liquid nitrogen source. Peptides were introduced into the 
ESI source at 1.0 ul/min by on-line capillary RP-HPLC as described above. Spectra 
were acquired scanning from m/z 400 to 2000 at a rate of 3sec. Data collection 
and reduction were performed on a Dec5100 computer using Finnigan BIOMASS™ 
software. 



2. RESULTS 



Separation and identification of the individual 6kD PI species from Njdat a stigmas 
The five of the six peptides of about 6kD that were predicted to be present in the 
purified 6kD PI preparation have been separated from each other by reversed-phase 
HPLC chromatography. Four peaks were obtained (Fig. 9a) and the peptides within 
each peak were identified by electrospray mass spectrometry (Table 2). The 
peptides have been designated CI, Tl, T2, T3 and T4 according to their position in 
the PI precursor and the presence of a chymotrypsin or trypsin reactive site (Fig 9b). 
The first HPLC peak (Fig 9a) corresponds to the chymotrypsin inhibitor CI, the 
second peak is composed of a mixture of T2 and T3 (identical to each other) and T4 
that differs from T2 and T3 by one amino-acid at position 32. The third peak 
contains the peptide Tl and the fourth peak is composed of a mixture of Tl, T2/T3 
and T4 (Table 2). 



The site of processing has not been precisely determined, but is likely to be located 
between the aspartate (N) and asparagine (D) residues in the sequence outlined in 
Figure 10. Proteases with specific requirements for asparagine residues have been 
isolated from vacuoles from immature soybean seeds and pumpkin cotyledons (Scott 
et aL, 1992, Hara-Nishimura et al„ 1991). This is consistent with the immunogold 
localization of the PI in the vacuoles of the papillae and the underlying secretory 
cells in the stigma of N.alata (Atkinson, 1992). In the case of the N.aiata PL 
processing analogous to that of peptide hormones is also possible because each of 
the possible 6kD peptides are flanked by dibasic residues (Lys-Lys, position -2 &-3 
in Figure 10). However, a system like this has not been described in plants, and it 
is more likely that the dibasic residues contribute to the predicted hydrophilic loops 
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that present the processing site on the surface of the molecule. 

The data from the mass spectrometric analysis shows that once the initial cleavage 
has occurred the new carboxy terminus is trimmed back (Figure 10). The EEKKN 
sequence (SEO ID NO. 14) is removed completely but the trimming is not precise, 
sometimes an additional amino acid is removed. Steric hindrance probably prevents 
further trimming. Occasionally the aspartate is also removed from the N -terminus. 

EXAMPLE 3 

Production of PI precursor in insect cell (Sf9) culture using a recombinant 
baculovirus vector. 

cDNA encoding the PI precursor (Figure 1) was inserted into the Eco Rl site of the 
plasmid vector pVL 1392, which is the same as pVL941 (Lucknow and Summers, 
1989) except that a multiple cloning site was inserted at the BamHl site. The 
plasmid designated pRHll, contains the PI cDNA in the correct orientation with 
respect to the direction of transcription directed by the polyhedrin promoter. 
Recombinant baculovirus was obtained by co-transfection of Spodoptera frugiperda 
cells with baculovirus DNA and pRHll. The recombinant viruses, produced by 
homologous recombination, were plaque purified and amplified prior to infection of 
insect cells for protein production. All procedures for production of recombinant 
baculovirus, titration of the virus and maintenance and infection of the Sf9 cells were 
obtained from King and Posse (1992). For production of the PI precursor, 
monolayers of Sf9 cells in large flasks (175cm 2 ) were infected at the time of 
confluence with an inoculum of high-titre recombinant virus at a multiplicity of 
infection of 5-10 pfu/cell. Culture fluid was collected after 4 days of infection, 
clarified by centrifugation and the PI precursor was purified by application to a 
Chymotrypsin-Sepharose affinity column as described for the 6kD PI species from 
stigmas. PI precursor eluted from the column in 7M urea, pH3 was neutralized 
immediately with 1M Tris-HCl buffer pH8, dialysed extensively against Milli-Q 
water, concentrated 20-50 fold by ultrafiltration using a Diaflow YM10 filter and 
stored frozen at -20°C. 
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The cDNA clone encoding the PI precursor was engineered into a baculovuirus 
vector for the production of the precursor from infected insect cells. The insect cells 
produced a 42kD protein that cross reacted with the antibodies raised to the 6kD PI 
peptides from stigma and bound to the chymotrypsin affinity column. This 42KD 
protein was identical in size to the 42kD precursor produced in the immature stigmas 
of NMata (Fig.ll) and had the N-terminal sequence LysAlaCysThrLeuAsn (SEQ 
ID NO. 13) demonstrating that the signal sequence had been processed correctly by 
the insect cells (Fig.l). Based on these results, the 42kD protein produced in the 
baculovirus expression system will now be referred to as the PI precursor. The 42kD 
PI precursor had inhibitory activity against chymotrypsin but no inhibitory activity 
against trypsin (Fig.13). Processing of the PI precursor by the endoproteinase AspN 
led to the production of stable peptides of about 6kD that were partially purified by 
reversed phase HPLC (Fig.12). These peptides have equivalent inhibitory activity 
against trypsin and chymotrypsin as the 6kD peptides isolated from stigma, indicating 
that processing of the precursor is required to activate the trypsin inhibitory activity 
but not all the chymotrypsin activity. Since AspN cleaves specifically adjacent to 
Aspartate residues (between Asn-1 and Aspl in Figure 10) and has no t rimming 
activity, the peptides produced in vitro will be similiar to those produced in stigmas 
except for the presence of the sequence EEKKN (SEQ ID NO. 14) at the C- 
terminus. This provides further evidence that precise processing of the N-and C- 
termini is not required to obtain an active 6kD PI peptide. Asp-Nl is more efficient 
at inhibiting chymotrypsin than trypsin and is thus likely to be predominantly a CI 
analogue (Fig.9b). Asp-N2 is a more efficient trypsin inhibitor and probably contains 
the T1-T4 analogues. 
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EXAMPLE 4 

Effect of Pis on protease activity in unlractionated gut extracts from various insects 
Activity of Pis on gut proteases was measured using the procedure of Christeller et 
al., (1992) as follows. An aliquot of luM of inhibitor ((MOvil, at least 5-fold excess 
over proteases present in the gut) was mixed with 150 pi of lOmM CAPS buffer, pH 
10, and preincubated with each insect gut extract (0-15pl), for 20 min at 30°C The 
reaction was started by the addition of 50\A of 14 C -labelled casein substrate (400pg 
protein, specific activity 25,000-75,000 dpm mg' 1 ) and continued for 30 min at 30°C 
until 50pl of cold 30% (w/v) TCA was added to terminate the reaction. After 
incubation on ice for 30 min, undigested protein was pelleted by centrifugation at 
20°C for 5 min at 10,000g. The supernatant was removed, mixed with scintillation 
fluid and the radioactivity measured. Assays were performed at pH 10 except for 
Lsericata and Cjitfifacies when lOmM Tris-HCl, pH 8,0 was used. 

Table 3 shows the inhibitory activity of the pooled 6kD PI peptides (CI, Tl, T2/T3, 
T4), the mixture of trypsin inhibitors T2/T3 and T4, and the chymotrypsin inhibitor 
CI against the proteases in the gut of various members of the Lepidoptera, 
Coleoptera, Orthoptera and Diptera. In most cases, the pooled peptides and the 
trypsin inhibitors had an equivalent effect against the gut proteases with the degree 
of inhibition ranging from 37-79% depending on the insect tested. The inhibitors 
had negligible effect on the gut proteases of the potato, tuber moth, P.opercullela 
The chymotrypsin inhibitor CI also affected the activity of the proteases but was less 
effective than the trypsin inhibitors in five cases ( W^cervinata, Ljerricata, 
Cjzealandica, P.octa, sugar cane grub). 

The experimental details are described in the legend to Figure 14. The N. alaia PI 
was more effective than Soybean Bowman-Birk inhibitor in reducing cricket weight. 
It has shown that there is a good correlation between the ability of a proteinase 
inhibitor to inhibit the enzymes of the insect midgut and its effectiveness in retarding 
the growth of insects in insect feeding trials (Christeller et al., 1992). Figure 14 
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shows that the pooled Pis that inhibited the gut proteases of the black field cricket 
(T.commodus) by 70% in the in vitro assay retarded the growth of the crickets by 
30% in a feeding trial conducted over a 10 week period. The correlation between 
in vitro assays and feeding trials has been confirmed recently by Johnston and 
collegues (1993) working on growth and development of Helicoverpa armigera 



_94l3810AlJ_;> 



WO 94/13810 



PCT/AU93/00659 



- 37 - 
TABLE 2 



I HPLC 
| peak 


retention time 
(min) 


molecular 
weight 


assigned peptide* 


1 1 


15.5 


5731.5 


CI 


I 




5644.4 


CI minus Ser 33 1 






5616.4 


CI minus Asp t & 






55.29.3 


CI minus Aspj 


i 


20.5 


5700.5 


T2/T3 






5728.5 


T4 








iz.1 \ o minus Asp j 







5613.5 


T4 minus Asp x 


3 


22.5 


5725.5 


Tl 






5610.5 


Tl minus Asp l 


4 


24 


5654.4 


Tl minus Ala^ 






5641.4 


T4 minus Ser^ 






5613.4 


T2/T3 minus Ser^ 






5539.4 


Tl minus Asp A & 
Ala^ [ 






5498.4 


T2/T3 minus Asp 1 & 






5526.4 


T4 minus Asp x & 

1 



* See Figure 9 for designation of CI and T1-T4. 
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TABLE3 

Effect of Nicotiana data proteinase inhibitors and Potato 
inhibitor II on casein hydrolysis by crude gut extracts 





casein hydrolysis (% control) 


Insect 


NaPI 




T4 


H. armigera 


33.2 


32.7 


30.3 


H. punctigera 


26.6 


29.3 


28.5 


T. commodus 


28.4 


35.0 


33.1 


II A. irtfusa 


37.5 


40.2 


43.3 


| sugar cane 
grub 


25.8 


43.9 


25.1 


W. cervinata 


22.9 


82.9 


20.4 


yyjsi vu ill fl/CL 


39.7 


45.4 


41.2 


S. litura 


28.1 


33.6 


24.8 


jP. opercullela 


95.8 


100 


98.5 


C. rufifacies 


29.1 


37.8 


28.9 


L. serricata 


59.2 


100 


63.0 


C. zealandica 


31.7 


54.7 


32.0 


P. octo 


57.1 


67.2 


57.4 


C. obliquana 


51.1 


49.1 


45.5 


A. tasmaniae 


283 


34.2 


39.5 
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Legend to Table 3 

NaPI = N. alata proteinase inhibitors pooled 

CI = N. alata chymotrypsin inhibitor (peak 1 from HPLC) 

T2/T3, T4 = N. alata trypsin inhibitors (peak 2 from HPLC) 

Heliothis armigera, Helicoverpa armigera, Tobacco budworm, Lepidoptera 

Heliothis punctigera, Helicoverpa punctigera Native budworm, Lepidoptera 

Teleogryilus commodus Black field cricket, Orthoptera 

Agrotis infusa Common cutworm, adults known as the Bogong moth, Lepidoptera 

Wiseana cervinata Porina, native to New Zealand, Lepidoptera 

Lucilla sericata Green blow fly, Diptera, assayed at pH8 

Chrysomya rupfacies Hairy maggot blow fly, Diptera, assayed at pH8 

Aphodius tasmaniae Tasmanian grass grub = Black-headed pasture cockchafer, 

Coleoptera 

Costelytra zealandica New Zealand grass grub, Coleoptera 
Spodoptera lUura Tropical armyworm, Lepidoptera 
Phthorimaea opercullela Potato tuber moth, Lepidoptera 
Epiphyas postvhtana Lightbrown apple moth (leafroller), Lepidoptera 
Planotortrix oclo Greenheaded leafroller, Lepidoptera 
Ctenopseustis obliquana Brownheaded leafroller, Lepidoptera 
Sugar cane grub 

Those skilled in the art will appreciate that the invention described herein is 
susceptible to variations and modifications other than those specifically described. 
It is to be understood that the invention includes all such variations and 
modifications. The invention also includes all of the steps, features, compositions 
and compounds referred to or indicated in this specification, individually or 
collectively, and any and all combinations of any two or more of said steps or 
features. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: (other than US) THE UNIVERSITY OF MELBOURNE 
(US only) ANDERSON, M A; ATKINSON, A H; 

HEATH, R L; and CLARKE, A E . 

(ii) TITLE OF INVENTION: A PROTEINASE INHIBITOR, PRECURSOR 

THEREOF AND GENETIC SEQUENCES ENCODING 
SAME 

(iii) NUMBER OF SEQUENCES: 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DAVIES COLLISON CAVE 

(B) STREET : 1 LITTLE COLLINS STREET 

(C) CITY: MELBOURNE 

(D) STATE: VICTORIA 

(E) COUNTRY: AUSTRALIA 

(F) ZIP: 3000 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1*0, Version #1,25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: AU PCT International 

(B) FILING DATE: 16-DEC-1993 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PL6399 - Australia 

(B) FILING DATE: 16-DEC-1992 

(viii) ATTORNEY / AGENT INFORMATION: 
(A) NAME: SLATTERY, JOHN M 

(C) REFERENCE / DOCKET NUMBER: EJH/JMS/EK 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (613) 254 2777 

(B) TELEFAX: (613) 254 2770 
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<2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AAGGCTTGTA CCTTAAACTG TGATCCAAGA ATTGCCTATG GAGTTTGCCC GCGTTCAGAA 60 

GAAAAGAAGA ATGATCGGAT ATGCACCAAC TGTTGCGCAG GCACGAAGGG TTGTAAGTAC 120 

TTCAGTGATG ATGGAACTTT TCTTTGTGAA CGAGAGTCTG ATCCTAGAAA TCCAAAGGCT 180 

TGTACCTTAA ACTGTGATCC AAGAATTGCC TATGGAGTTT GCCCGCGTTC AGAAGAAAAG 240 

AAGAATGATC GGATATGCAC CAACTGTTGC GCAGGCACGA AGGGTTGTAA GTACTTCAGT 300 

GATGATGGAA CTTTTGTTTG TGAAGGAGAG TCTGATCCTA GAAATCCAAA GGCTTGTCCT 360 

CGGAATTGCG ATCCAAGAAT TGCCTATGGG ATTTGCCCAC TTGCAGAAGA AAAGAAGAAT 420 

GATCGGATAT GCACCAACTG TTGCGCAGGC AAAAAGGGTT GTAAGTACTT TAGTGATGAT 480 

GGAACTTTTG TTTGTGAAGG AGAGTCTGAT CCTAAAAATC CAAAGGCCTG TCCTCGGAAT 540 

TGTGATGGAA GAATTGCCTA TGGGATTTGC CCACTTTCAG AAGAAAAGAA GAATGATCGG 600 

ATATGCACCA ACTGCTGCGC AGGCAAAAAG GGTTGTAAGT ACTTTAGTGA TGATGGAACT 660 

TTTGTTTGTG AAGGAGAGTC TGATCCTAAA AATCCAAAGG CTTGTCCTCG GAATTCTGAT 720 

GGAAGAATTG CCTATGGGAT TTGCCCACTT TCAGAAGAAA AGAAGAATGA TCGGATATGC 780 

ACAAACTGTT GCCCAGGCAA AAAGGGCTGT AAGTACTTTA GTGATGATGG AACTTTTGTT 840 

TGTGAAGGAG AGTCTGATCC TAGAAATCCA AAGGCCTGTC CTCGGAATTG TGATGGAAGA 900 

ATTGCCTATG GAATTTGCCC ACTTTCAGAA GAAAAGAAGA ATGATCGGAT ATGCACCAAT 960 

TGTTGCGCAG GCAACAAGGG CTGTAAGTAC TTTAGTGATG ATGGAACTTT TATTTGTGAA 1020 
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GGAGAATCTG AATATGCCAG CAAAGTGGAT GAATATGTTG GTGAACTGGA GAATGATCTC 1080 
CAGAAGTCTA AGGTTGCTGT TTCC 1104 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 97.. 1200 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CGAGTAAGTA TGGCTGTTCA CAGAGTTAGT TTCCTTGCTC TCCTCCTCTT ATTTGGAATG 

TCTCTGCTTG TAAGCAATGT GGAACATGCA GATGCC AAG GCT TGT ACC TTA AAC 

Lys Ala Cys Thr Leu Asn 
1 5 



60 
114 



TGT GAT CCA AGA ATT GCC TAT GGA GTT TGC CCG CGT TCA GAA GAA AAG 162 
Cys Asp Pro Arg He Ala Tyr Gly Val Cys Pro Arg Ser Glu Glu Lys 
10 15 20 

AAG AAT GAT CGG ATA TGC ACC AAC TGT TGC GCA GGC ACG AAG GGT TGT 210 
Lys Asn Asp Arg He Cys Thr Asn Cys Cys Ala Gly Thr Lys Gly Cys 
25 30 35 

AAG TAC TTC AGT GAT GAT GGA ACT TTT GTT TGT GAA GGA GAG TCT GAT 258 
Lys Tyr Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp 
40 45 50 

CCT AGA AAT CCA AAG GCT TGT ACC TTA AAC TGT GAT CCA AGA ATT GCC 306 
Pro Arg Asn Pro Lys Ala Cys Thr Leu Asn Cys Asp Pro Arg He Ala 
55 60 65 70 

TAT GGA GTT TGC CCG CGT TCA GAA GAA AAG AAG AAT GAT CGG ATA TGC 354 
Tyr Gly Val Cys Pro Arg Ser Glu Glu Lys Lys Asn Asp Arg He Cys 
75 80 85 



BNSDOC1D: <WO 84l38>0A>_l-> 



WO 94/13810 PCT/AU93/00659 

-46- 



ACC AAC TGT TGC GCA GGC ACG AAG GGT TGT AAG TAC TTC AGT GAT GAT 402 
Thr Asn Cys Cys Ala Gly Thr Lys Gly Cys Lys Tyr Phe Ser Asp Asp 
90 95 100 

GGA ACT TTT GTT TGT GAA GGA GAG TCT GAT CCT AGA AAT CCA AAG GCT 450 
GLy Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Arg Asn Pro Lys Ala 
105 110 115 

TGT CCT CGG AAT TGC GAT CCA AGA ATT GCC TAT GGG ATT TGC CCA CTT 498 
Cys Pro Arg Asn Cys Asp Pro Arg lie Ala Tyr Gly He Cys Pro Leu 
120 125 130 

GCA GAA GAA AAG AAG AAT GAT CGG ATA TGC ACC AAC TGT TGC GCA GGC 546 
Ala Glu Glu Lys Lys Asn Asp Arg lie Cys Thr Asn Cys Cys Ala Gly 
135 140 145 150 

AAA AAG GGT TGT AAG TAC TTT AGT GAT GAT GGA ACT TTT GTT TGT GAA 594 
Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr Phe Val Cys Glu 
155 160 165 

GGA GAG TCT GAT CCT AAA AAT CCA AAG GCC TGT CCT CGG AAT TGT GAT 642 
Gly Glu Ser Asp Pro Lys Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp 
170 175 180 

GGA AGA ATT GCC TAT GGG ATT TGC CCA CTT TCA GAA GAA AAG AAG AAT 690 
Gly Arg He Ala Tyr GLy He Cys Pro Leu Ser Glu Glu Lys Lys Asn 
185 190 195 

GAT CGG ATA TGC ACC AAC TGC TGC GCA GGC AAA AAG GGT TGT AAG TAC 738 
Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 
200 205 210 

TTT AGT GAT GAT GGA ACT TTT GTT TGT GAA GGA GAG TCT GAT CCT AAA 786 
Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Lys 
215 220 225 230 

AAT CCA AAC GCT TGT CCT CGG AAT TGT GAT GGA AGA ATT GCC TAT GGG 834 
Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Gly Arg He Ala Tyr Cly 
235 240 245 

ATT TGC CCA CTT TCA GAA GAA AAG AAG AAT GAT CGG ATA TGC ACA AAC 882 
He Cys Pro Leu Ser Glu Glu Lys Lys Asn Asp Arg He Cys Thr Asn 
250 255 260 
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TGT TGC GCA GGC AAA AAG GGC TGT AAG TAC TTT ACT GAT GAT GGA ACT 930 
Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr 
265 270 275 

TTT GTT TGT GAA GGA GAG TCT GAT CCT AGA AAT CCA AAG GCC TGT CCT 978 
Phe Val Cys Glu Gly Glu Ser Asp Pro Arg Asn Pro Lys Ala Cys Pro 
280 285 290 

CGG AAT TGT GAT GGA AGA ATT GCC TAT GGA ATT TGC CCA CTT TCA GAA 1026 
Arg Asn Cys Asp Gly Arg lie Ala Tyr Gly He Cys Pro Leu Ser Glu 
295 300 305 310 

GAA AAG AAG AAT GAT CGG ATA TGC ACC AAT TGT TGC GCA GGC AAG AAG 1074 
Glu Lys Lys Asn Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys 
315 320 325 

GGC TGT AAG TAC TTT AGT GAT GAT GGA ACT TTT ATT TGT GAA GGA GAA 1122 
Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr Phe He Cys Glu Gly Glu 
330 335 340 

TCT GAA TAT GCC AGC AAA GTG GAT GAA TAT GTT GGT GAA GTG GAG AAT 1170 
Ser Glu Tyr Ala Ser Lys Val Asp Glu Tyr Val Gly Glu Val Glu Asn 
345 350 355 

GAT CTC CAG AAG TCT AAG GTT GCT GTT TCC TAAGTCCTAA CTAATAATAT 1220 
Asp Leu Gin Lys Ser Lys Val Ala Val Ser 
360 365 

GTAGTCTATG TATGAAACAA AGGCATGCCA ATATGCTCTG TCTTGCCTGT AATCTGTAAT 1280 

ATGGTAGTGG AGCTTTTCCA CTGCCTGTTT AATAAGAAAT GG AGC AC TAG TTTGTTTTAG 1340 

TTAAAAAAAA AAAAAAAAAA 1360 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 368 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Lys Ala Cys Thr Leu Asn Cys Asp Pro Arg lie Ala Tyr Gly Val Cys 
15 10 15 

Pro Arg Ser Glu Glu Lys Lys Asn Asp Arg lie Cys Thr Asn Cys Cys 
20 25 30 

Ala Gly Thr Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr Phe Val 
35 40 45 

Cys Clu Gly Glu Ser Asp Pro Arg Asn Pro Lys Ala Cys Thr Leu Asn 
50 55 60 

Cys Asp Pro Arg lie Ala Tyr Gly Val Cys Pro Arg Ser Glu Glu Lys 
65 70 75 80 

Lys Asn Asp Arg He Cys Thr Asn Cys Cys Ala Gly Thr Lys Gly Cys 
85 90 95 

Lys Tyr Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp 
100 105 110 

Pro Arg Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Pro Arg He Ala 
115 120 125 

Tyr Gly He Cys Pro Leu Ala Glu Glu Lys Lys Asn Asp Arg He Cys 
130 135 140 

Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp 
145 150 155 160 

Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Lys Asn Pro Lys Ala 
165 170 175 

Cys Pro Arg Asn Cys Asp Gly Arg He Ala Tyr Gly He Cys Pro Leu 
180 185 190 
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Ser Glu Glu Lys Lys Asn Asp Arg lie Cys Thr Asn Cys Cys Ala Gly 
195 200 205 

Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr Phe Val Cys Glu 
210 215 220 

Gly Glu Ser Asp Pro Lys Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp 
225 230 235 240 

Gly Arg lie Ala Tyr Gly lie Cys Pro Leu Ser Glu Glu Lys Lys Asn 
245 250 255 

Asp Arg lie Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 
260 265 270 

Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Arg 
275 280 285 

Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Gly Arg lie Ala Tyr Gly 
290 295 300 

lie Cys Pro Leu Ser Glu Glu Lys Lys Asn Asp Arg lie Cys Thr Asn 
305 310 315 320 

Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr Phe Ser Asp Asp Gly Thr 
325 330 335 

Phe lie Cys Glu Gly Glu Ser Glu Tyr Ala Ser Lys Val Asp Glu Tyr 
340 345 350 

Val Gly Glu Val Glu Asn Asp Leu Gin Lys Ser Lys Val Ala Val Ser 
355 360 365 
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(2) INFORMATION FOR SEQ ID NO: A: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Lys Ala Cys Thr Leu Asn Cys Asp Pro Arg lie Ala Tyr Gly Val Cys 
1 5 10 15 

Pro Arg Ser Glu Glu Lys Lys Asn 
20 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Asp Arg lie Cys Thr Asn Cys Cys Ala Gly Thr Lys Gly Cys Lys Tyr 
1 5 10 15 

Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Arg 
20 25 30 

Asn Pro Lys Ala Cys Thr Leu Asn Cys Asp Pro Arg He Ala Tyr Gly 
35 40 45 

Val Cys Pro Arg Ser Glu Glu Lys Lys Asn 
50 55 
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(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOCY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Asp Arg lie Cys Thr Asn Cys Cys Ala Gly Thr Lys Gly Cys Lys Tyr 
1 5 10 15 

Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Arg 
20 25 30 

Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Pro Arg He Ala Tyr Gly 
35 AO 45 

He Cys Pro Leu Ala Glu Glu Lys Lys Asn 
50 55 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asp Arg lie Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 
1 5 io 15 

Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Lys 
20 25 3Q 

Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Gly Arg He Ala Tyr Gly 
35 40 A5 

He Cys Pro Leu Ser Glu Glu Lys Lys Asn 
50 55 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Asp Arg lie Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 
1 5 10 15 

Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Lys 
20 25 30 

Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Gly Arg He Ala Tyr Gly 
35 40 45 

He Cys Pro Leu Ser Glu Glu Lys Lys Asn 
50 55 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 
1 5 10 15 

Phe Ser Asp Asp Gly Thr Phe Val Cys Glu Gly Glu Ser Asp Pro Arg 
20 25 30 

Asn Pro Lys Ala Cys Pro Arg Asn Cys Asp Gly Arg He Ala Tyr Gly 
35 40 45 

He Cys Pro Leu Ser Glu Glu Lys Lys Asn 
50 55 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Arg lie Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly Cys Lys Tyr 
1 5 10 15 

Phe Ser Asp Asp Gly Thr Phe He Cys Glu Gly Glu Ser Glu Tyr Ala 
20 25 30 

Ser Lys Val Asp Glu Tyr Val Gly Glu Val Glu Asn Asp Leu Gin Lys 
35 40 45 

Ser Lys Val Ala Val Ser 
50 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Asp Arg lie Cys Thr Asn Cys Cys Ala Gly Thr Lys Gly 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Asp Arg He Cys Thr Asn Cys Cys Ala Gly Lys Lys Gly 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: singLe 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Lys Ala Cys Thr Leu Asn 
1 5 



(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Glu Glu Lys Lys Asn 
1 5 
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CLAIMS: 

1 . A nucleic acid isolate comprising a sequence of nucleotides which encodes 
or is complementary to a sequence which encodes a type II serine proteinase 
inhibitor (PI) precursor from a plant wherein said precursor comprises at least three 
PI monomers and wherein at least one of said monomers has a chymotrypsin specific 
site and at least one other of said monomers has a trypsin specific site. 

2. A nucleic acid isolate according to claim 1 wherein said PI precursor 
comprises at least four monomers. 

3. A nucleic acid isolate according to claim 1 wherein the PI precursor 
comprises at least five monomers. 

4. A nucleic acid isolate according to claim 1 wherein the PI precursor 
comprises at least six monomers. 

5. A nucleic acid isolate according to claim 1 wherein said isolate comprises 
a sequence of nucleotides as set forth in SEQ ID NO. 1 or having at least 55% 
nucleotide similarity to all or part thereof. 

6. A nucleic acid isolate according to claim 1 or 5 wherein said nucleic acid 
isolate is capable of hybridising under low stringency conditions to a complementary 
sequence to SEQ ID NO. 1. 

7. A nucleic acid isolate comprising a sequence of nucleotides which encodes 
or is complementary to a sequence which encodes a single type II serine PI having 
either a chymotrypsin specific site or a trypsin specific site and wherein said PI is a 
monomer of a precursor PI having at least three monomers of which at least one of 
said monomers has a chymotrypsin site and the other of said monomers has a trypsin 
site. 
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8. A nucleic acid isolate according to claim 7 comprising a sequence of 
nucleotides which is at least 55% similar to all or part of SEQ ID NO. 1. 

9. A nucleic acid isolate according to claim 7 which is capable of hybridising 
under low stringency conditions to a complementary nucleotide sequence to SEQ ID 
No. 1. 

10. A nucleic acid isolate according to claim 7 or 8 or 9 comprising a 
nucleotide sequence which encodes a peptide selected from (SEQ ID NO. 5); (SEQ 
ID NO. 6); (SEQ ID NO. 7); (SEQ ID NO. 8); (SEQ ID NO. 9); (SEQ ID NO. 
10). 

11. A nucleic acid isolate according to claim 7 or 8 or 9 comprising a 
nucleotide sequence which encodes a peptide defined by SEQ ID NO. 4. 

12. A recombinant type II serine PI precursor from a plant wherein said 
precursor comprises at least three PI monomers and wherein at least one of said 
monomers has a chymotrypsin site and at least one other of said monomers has a 
trypsin specific site. 

13. A recombinant PI precursor according to claim 12 wherein said PI 
precursor comprises at least four monomers. 

14. A recombinant PI precursor according to claim 12 wherein said PI 
precursor comprises at least five monomers. 

15. A recombinant PI precursor according to claim 12 wherein said PI 
precursor comprises at least six monomers. 

16. A recombinant PI precursor according to claim 12 wherein said PI 
precursor comprises a sequence of amino acids as set forth in SEQ ID NO. 3 or 
having at least 55% similarity to all or part thereof. 
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17. A monomer of the recombinant PI according to claim 12. 

18. A monomer according to claim 17 selected from the list consisting of 
amino acid residues 25-82 (SEQ ID NO. 5); amino acid residues 83-140 (SEQ ID 
NO. 6); amino acid residues 141-198 (SEQ ID NO. 7); amino acid residues 199-256 
(SEQ ID NO. 8); amino acid residues 257-314 (SEQ ID NO. 9); and amino acid 
residues 315-368 (SEQ ID NO. 10); of the amino acid sequence set forth in Figure 
1 (SEQ ID NO. 3). 

19. A monomer according to claim 17 defined by the amino acid residues 1 
to 24 (SEQ ID NO. 4) of the amino acid sequence set forth in Figure 1 (SEQ ID 
NO. 3). 

20. A protease sensitive peptide comprising the amino acid sequence: 

R 1 -X 1 -X 2 -Asn-Asp-R 2 

wherein X x and X 2 are preferably the same and are preferably both Lys residues and 
wherein Rj and R 2 may be the same or different and each is a D or L amino acid, 
a peptide, a polypeptide, a protein, or an alkyl, substituted alkyl, alkenyl, substituted 
alkenyl, acyl, dienyl, arylalkyl, arylalkenyl, aryl, substituted aryl, heterocyclic, 
substituted heterocyclic, cycloalkyl, substituted cycloalkyl, halo, haloalkyl, nitro, 
hydroxy, thiol, sulfonyl, carboxy, alkoxy, aryloxy and alkyl aryloxy group and the like. 

21. A protease sensitive peptide according to claim 21 wherein R A and R 2 may 
be the same or different and each is a peptide or polypeptide and X! and X 2 are 
each Lys. 

22. A protease sensitive peptide according to claim 20 or 21 in recombinant 
ot synthetic form. 
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23. A nucleic acid molecule encoding the protease sensitive peptide according 
to claim 22. 

24. A genetic construct comprising a nucleic acid molecule comprising a 
sequence of nucleotides which encodes or is complementary to a sequence which 
encodes a type II serine PI precursor or monomer thereof from a plant wherein said 
precursor comprises at least three PI monomers and wherein at least one of said 
monomers has a chymotrypsin specific site and at least one of said other monomers 
has a trypsin specific site and wherein said genetic sequence further comprises 
expression means to permit expression of said nucleic acid molecule, replication 
means to permit replication in a plant cell or, alternatively, integration means, to 
permit stable integration of said nucleic acid molecule into a plant cell genome. 

25. A transgenic plant carrying a genetic construct, said genetic construct 
comprising a deoxyribonucleic acid molecule which encodes a type II serine PI or 
monomer thereof, wherein said precursor comprises a sequence of nucleotides which 
encodes or is complementary to a sequence which encodes a type II serine 
proteinase inhibitor (PI) precursor from a plant wherein said precursor comprises at 
least three PI monomers and wherein at least one of said monomers has a 
chymotrypsin specific site and at least one other of said monomers has a trypsin 
specific site. 

26. A transgenic plant according to claim 25 which produces one or more PI 
monomers selected from the listing consisting of amino acid residues 25-82 (SEQ ID 
NO. 5); amino acid residues 83-140 (SEQ ID NO. 6); amino acid residues 141-198 
(SEQ ID NO. 7); amino add residues 199-256 (SEQ ID NO. 8); amino acid 
residues 257-314 (SEQ ID NO. 9); and amino acid residues 315-368 (SEQ ID NO. 
10) of the amino acid sequence set forth in Figure 1 (SEQ H> NO. 3). 
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27. A transgenic plant according to claim 25 which produces a PI monomer 
consisting of amino acid residues 1-24 (SEQ ID NO. 4) of the amino acid sequence 
set forth in SEQ ID NO. 3. 

28. A method of increasing, enhancing or otherwise facilitating resistance of 
a plant to insect or other pathogen infestation, said method comprising introducing 
a nucleic acid molecule as defined in claim 1 or 7 or 10 or 11 into a cell or group 
of cells of said plant, regenerating a plant therefrom and growing said plant for a 
time and under conditions sufficient to permit expression of said nucleic acid 
molecule into a PI or precursor thereof capable of inhibiting growth and /or 
infestation by said pathogen. 
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Box II Observations where unity of Invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: ____ _ 

S&T!^ proteinase inhibitor from plants, nucleic acid sequences coding therefore and 

deavage %ypu>t£ase$ t0 * protease seasitive Peptide which comprises a specific amine acid sequence which is sensitive to 



□ As all required additional search fees were timely paid by the appiicant, this international 
search report covers all searchable clauns 

□ As all searchable claims could be searched without effort justifying an additional fee, this 
Authority did not invite payment of any additional fee. 

□ As only some of the required additional search fees were timely paid by the applicant, this 
international search report covers only those claims for which fees were paid, specifically 



claims Nos.: 
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No required additional search fees were timely paid bv the applicant. Conseouently this 
international search report is restricted to the invention first mentioned in the claims; 
it is covered by claims Nos,: 



1-19, 24-28 



The additional search fees were accompanied by the applicant's protest. 
No protest accompanied the payment of additional search fees. 
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